MENLI: Robust Evaluation Metrics from Natural Language Inference
نویسندگان
چکیده
Abstract Recently proposed BERT-based evaluation metrics for text generation perform well on standard benchmarks but are vulnerable to adversarial attacks, e.g., relating information correctness. We argue that this stems (in part) from the fact they models of semantic similarity. In contrast, we develop based Natural Language Inference (NLI), which deem a more appropriate modeling. design preference-based attack framework and show our NLI much robust attacks than recent metrics. On benchmarks, outperform existing summarization metrics, below SOTA MT However, when combining with obtain both higher robustness (15%–30%) quality as measured (+5% 30%).
منابع مشابه
Natural Language Inference from Multiple Premises
We define a novel textual entailment task that requires inference over multiple premise sentences. We present a new dataset for this task that minimizes trivial lexical inferences, emphasizes knowledge of everyday events, and presents a more challenging setting for textual entailment. We evaluate several strong neural baselines and analyze how the multiple premise task differs from standard tex...
متن کاملNatural language directed inference from ontologies
This paper presents an investigation into the problem of content determination in natural language generation (NLG), using as an example the problem of determining what to say when asked “What is an A?”, where A is a concept defined in an OWL ontology. It is shown that a naive approach to this problem, which just presents a set of the stated axioms, will often inadvertantly violate maxims of co...
متن کاملRobust Natural Language Analysis
Our basic goal is the development of more robust systems for extracting information from natural language text. A robust system is one which is able to extract at least partial information despite the presence of ill-formed or unexpected syntactic, semantic, or discourse structures. Our approach has two aspects: First, we incorporate a rich set of syntactic, semantic, and discourse constraints,...
متن کاملNatural logic and natural language inference
We propose a model of natural language inference which identifies valid inferences by their lexical and syntactic features, without full semantic interpretation. We extend past work in natural logic, which has focused on semantic containment and monotonicity, by incorporating both semantic exclusion and implicativity. Our model decomposes an inference problem into a sequence of atomic edits lin...
متن کاملNatural Language Inference in Coq
In this paperwe propose away to dealwith natural language inference (NLI) by implementing Modern Type Theoretical Semantics in the proof assistant Coq. The paper is a first attempt to deal with NLI and natural language reasoning in general by using the proof assistant technology. Valid NLIs are treated as theorems and as such the adequacy of our account is tested by trying to prove them. We use...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Association for Computational Linguistics
سال: 2023
ISSN: ['2307-387X']
DOI: https://doi.org/10.1162/tacl_a_00576